Page 1 of 7 
OIPE 





4 




6 




8 




9 




10 




12 




14 




15 




16 




17 




18 




19 




20 




2 2 




23 




24 




25 




26 




28 


c- - > 


29 


c--> 


30 




31 




37 




34 




35 




38 




39 




41 




42 




43 




44 




46 




47 




48 




51 




53 




54 




55 




56 




57 




59 




60 




65 




67 




69 




71 



RAW SEQUENCE LISTING DATE: 11/28/2001 

PATENT APPLICATION: US/09/922,6 8 3 TIME: 10:38:37 

Input Set : N:\Crf3\RULE60\09922683.txt 
Output Set: N:\CRF3\11282001\I922683.raw 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: DECKER, Heinrich 
(ii) TITLE OF INVENTION: ISOLATION OF THE BIOSYNTHESIS GENES FOR 

PSEUDO-OLICjaSACCHARIDES FROM STREPTOMYCES GLAUCESCENS 
. OJUJJ^ THEIR USE 




(iii) NUMBER OF SEQUENCES j 
(iv) CORRESPONDENCE ADDR* 

(A) ADDRESSEE: FOLEY & LARDNER 

(B) STREET: 3000 K Street, N.W. 

(C) CITY: Washington ^ ^ 

(D) STATE: D.C. fT <M HT W fc—fc P" \ J 

( E ) COUNTRY : U.S.A. | \g i ^ * u - 

(F) ZIP: 20007-5109 " 
(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC" DOS/MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/922 r 68 3 

(B) FILING DATE: 07-Aug-2001 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/194,905 

(B) FILING DATE: 1999-12-01 

(A) APPLICATION NUMBER: DE 19622783.6 

(B) FILING DATE: 07-JUN-1996 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Granados, Patricia D. 

(B) REGISTRATION NUMBER: 33,683 

(C) REFERENCE/DOCKET NUMBER: 026083/0193 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 672-5300 

(B) TELEFAX : (202) 672-5399 
(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CSGGSGSSGC SGGSTTCATS GG 22 
(2) INFORMATION FOR SEQ ID NO : 2: 
(i) SEQUENCE CHARACTERISTICS: 
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72 (A) LENGTH: 24 base pairs 

73 (B) TYPE: nucleic acid 

74 (C) STRANDEDNESS: single 

7 5 (D) TOPOLOGY: linear 

77 (ii) MOLECULE TYPE: other nucleic acid 

78 (A) DESCRIPTION: /desc = "Primer" 
83 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

8 5 GGGWVCTGGY VSGGSCCGTA GTTG 24 
87 (2) INFORMATION FOR SEQ ID NO : 3: 

89 (i) SEQUENCE CHARACTERISTICS: 

90 (A) LENGTH: 546 base pairs 

91 (B) TYPE: nucleic acid 

92 (C) STRANDEDNESS: single 

93 (D) TOPOLOGY: linear 

95 (ii) MOLECULE TYPE: DNA (genomic) 

100 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

102 CCCGGGCGGG GCGGGGTTCA TCGGCTCCGC CTACGTCCGC CGGCTCCTGT CGCCCGGGGC 60 

104 CCCCGGCGGC GTCGCGGTGA CCGTCCTCGA CAAACTCACC TACGCCGGCA GCCTCGCCCG 120 

106 CCTGCACGCG GTGCGTGACC ATCCCGGCCT CACCTTCGTC CAGGGCGACG TGTGCGACAC 180 

108 CGCGCTCGTC GACACGCTGG CCGCGCGGCA CGACGACATC GTGCACTTCG CGGCCGAGTC 240 

110 GCACGTCGAC CGCTCCATCA CCGACAGCGG TGCCTTCACC CGCACCAACG TGCTGGGCAC 300 

112 CCAGGTCCTG CTCGACGCCG CGCTCCGCCA CGGTGTGCGC ACCCTCGTGC ACGTCTCCAC 360 

114 CGACGAGGTG TACGGCTCCC TCCCGCACGG GGCCGCCGCG GAGAGCGACC CCCTGCTCCC 420 

116 GACCTCGCCG TACGCGGCGT CGAAGGCGGC CTCGGACCTC ATGGCGCTCG CCCACCACCG 4 80 

118 CACCCACGGC CTGGACGTCC GGGTGACCCG CTGTTCGAAC AACTACGGCC CGCACCAGTT 540 

120 CCCGGG 546 

122 (2) INFORMATION FOR SEQ ID NO: 4: 

124 (i) SEQUENCE CHARACTERISTICS: 

125 (A) LENGTH: 541 base pairs 

126 (B) TYPE: nucleic acid 

127 (C) STRANDEDNESS: single 

128 (D) TOPOLOGY: linear 

130 (ii) MOLECULE TYPE: DNA (genomic) 

135 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

137 CCCCGGGTGC TGGTAGGGGC CGTAGTTGTT GGAGCAGCGG GTGATGCGCA CGTCCAGGCC 60 

139 GTGGCTGACG TGCATGGCCA GCGCGAGCAG GTCGCCCGAC GCCTTGGAGG TGGCATAGGG 120 

141 GCTGTTGGGG CGCAGCGGCT CGTCCTCCGT CCACGACCCC GTCTCCAGCG AGCCGTAGAC 180 

14 3 CTCGTCGGTG GACACCTGCA CGAAGGGGGC CACGCCGTGC CGCAGGGCCG CGTCGAGGAG 240 

14 5 TGTCTGCGTG CCGCCGGCGT TGGTCCGCAC GAACGCGGCG GCATCGAGCA GCGAGCGGTC 300 

147 CACGTGCGAC TCGGCGGCGA GGTGCACGAC CTGGTCCTGG CCGGCCATGA CCCGGTCGAC 360 

149 CAGGTCCGCG TCGCAGATGT CGCCGTGGAC GAAGCGCAGC CGGGGGTGGT CGCGGACCGG 420 

151 GTCGAGGTTG GCGAGGTTGC CGGCGTAGCT CAGGGCGTCG AGCACGGTGA CGACGGCGTC 480 

153 GGGCGGCCCG TCCGGACCGA GGAGGGTGCG GACGTAGTGC GAGCCCATGA ACCCCGCCGC 540 

155 C 541 

157 (2) INFORMATION FOR SEQ ID NO: 5: 
159 (i) SEQUENCE CHARACTERISTICS: 

1 fiO (A) LENGTH: 180 amino acids 

161 (B) TYPE: amino acid 

162 (C) STRANDEDNESS: 
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163 (D) TOPOLOGY: linear 

165 (ii) MOLECULE TYPE: protein 

170 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

172 Ala Ala Gly Phe Met Gly Ser His Tyr Val Arg Thr Leu Leu Gly Pro 

173 15 10 15 

175 Asp Gly Pro Pro Asp Ala Val Val Thr Val Leu Asp Ala Leu Ser Tyr 

176 20 25 30 

178 Ala Gly Asn Leu Ala Asn Leu Asp Pro Val Arg Asp His Pro Arg Leu 

179 35 40 45 

181 Arg Phe Val His Gly Asp lie Cys Asp Ala Asp Leu Val Asp Arq Val 

182 50 55 60 

184 Met Ala Gly Gin Asp Gin Val Val His Leu Ala Ala Glu Ser His Val 

185 65 70 75 80 

187 Asp Arg Ser Leu Leu Asp Ala Ala Ala Phe Val Arg Thr Asn Ala Gly 

188 85 90 95 

190 Gly Thr Gin Thr Leu Leu Asp Ala Ala Leu Arg His Gly Val Ala Pro 

191 100 105 110 

193 Phe Val Gin Val Ser Thr Asp Glu Val Tyr Gly Ser Leu Glu Thr Gly 

194 115 120 125 

196 Ser Trp Thr Glu Asp Glu Pro Leu Arg Pro Asn Ser Pro Tyr Ala Thr 

197 130 135 140 

199 Ser Lys Ala Ser Gly Asp Leu Leu Ala Leu Ala Met His Val Ser His 

200 145 150 155 160 

202 Gly Leu Asp Val Arg lie Thr Arg Cys Ser Asn Asn Tyr Gly Pro Tyr 

203 165 170 175 

205 Gin His Pro Gly 

206 180 

208 (2) INFORMATION FOR SEQ ID NO : 6: 

210 (i) SEQUENCE CHARACTERISTICS: 

211 (A) LENGTH: 181 amino acids 

212 (B) TYPE: amino acid 

213 (C) STRANDEDNESS : 

214 (D) TOPOLOGY: linear 
216 (ii) MOLECULE TYPE: protein 

221 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

223 Pro Gly Gly Ala Gly Phe lie Gly Ser Ala Tyr Val Arg Arg Leu Leu 

224 15 10 15 

226 Ser Pro Gly Ala Pro Gly Gly Val Ala Val Thr Val Leu Asp Lys Leu 

227 20 25 30 

229 Thr Tyr Ala Gly Ser Leu Ala Arg Leu His Ala Val Arg Asp His Pro 

230 35 40 45 

232 Gly Leu Thr Phe Val Gin Gly Asp Val Cys Asp Thr Ala Leu Val Asp 

233 50 55 60 

235 Thr Leu Ala Ala Arg His Asp Asp He Val His Phe Ala Ala Glu Ser 

236 65 70 75 80 
238 His Val Asp Arg Ser He Thr Asp Ser Gly Ala Phe Thr Arg Thr Asn 
?39 85 90 95 

241 Val Leu Gly Thr Gin Val Leu Leu Asp Ala Ala Leu Arg His Gly Val 

242 100 105 110 
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244 Arg Thr Leu Val His Val Ser Thr Asp Glu Val Tyr Gly Ser Leu Pro 

245 115 120 125 

247 His Gly Ala Ala Ala Glu Ser Asp Pro Leu Leu Pro Thr Ser Pro Tyr 

248 130 135 140 

250 Ala Ala Ser Lys Ala Ala Ser Asp Leu Met Ala Leu Ala His His Arg 

251 145 150 155 160 

253 Thr His Gly Leu Asp Val Arg Val Thr Arg Cys Ser Asn Asn Tyr Gly 

254 165 170 175 

256 Pro His Gin Phe Pro 

257 180 

259 (2) INFORMATION FOR SEQ ID NO : 7: 

261 (i) SEQUENCE CHARACTERISTICS: 

262 (A) LENGTH: 6854 base pairs 

263 (B) TYPE: nucleic acid 

264 (C) STRANDEDNESS : single 

265 (D) TOPOLOGY: linear 

267 (ii) MOLECULE TYPE: DNA (genomic) 

272 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



274 


CTGCAGGGTT 


CCCTGGTGCA 


CGACCCGCCC 


CTGGTCGACG 


ACCAGGGCGC 


TGTCGCAGAT 


60 


276 


CGCGGCGATG 


TCGGCGATGT 


CGTGGCTGGT 


GAGCACCACG 


GTGGTGCCCA 


GTTCCCGGTG 


120 


278 


GGCGCGGTTG 


ACCAGCCGGC 


GCACCGCGTC 


CTTCAGCACC 


ATGTCGAGGC 


CGATCGTGGG 


180 


280 


CTCGTCCCAG 


AACAGCACGG 


CCGGGTCGTG 


CAGCAGGCTC 


GCCGCGATCT 


CGGCGCGCAT 


240 


282 


GCGCTGTCCG 


AGGCTGAGCT 


GCCGCACGGG 


GGTGGACCCC 


AGCGCGTCGA 


TGTCGAGGAG 


300 


284 


GTCCCGGAAC 


AGGGCGAGGT 


TGCGCCGGTA 


GACCGGTCCG 


GGGATGTCGT 


AGATGCGGCG 


360 


286 


CAGGATGCGG 


AAGGAGTCGG 


GTACCGACAG 


GTCCCACCAG 


AGCTGGCTGC 


GCTGGCCGAA 


420 


288 


GACGACGCCG 


ATCGTGCGGG 


CGTTGCGCTG 


CCGGTGCCGG 


TAGGGCTCCA 


GCCCGGCGAC 


480 


290 


CGTGCAGCGG 


CCGGAGGTGG 


GGGTCATGAT 


GCCGGTCAGC 


ATCTTGATCG 


TGGTCGACTT 


540 


292 


GCCGGCTCCG 


TTGGCGCCGA 


TGTAGGCGGT 


CTTCGTGCCG 


GCCGGTATCT 


CGAAGGAGAC 


600 


294 


GTCGTCGACG 


GCGCGCACGA 


CGCGGTACCG 


GCGGGTCAGG 


AGGGTGGAGA 


GGCTGCCGAG 


660 


296 


CAGGCCGGGC 


TCGCGTTCGG 


CCAGCCGGAA 


CTCCTTGACG 


AGGTGTTCGG 


CCACGATCAC 


720 


298 


GCGATCACCC 


GCTCGACGGC 


CGTCTCCAGC 


AGGCGCAGGC 


CCTCGTCGAG 


CAGCGCCTCG 


780 


300 


TCGAGGGTGA 


ACGGCGGTGC 


CAGCCGCAGG 


ATGTGGCCGC 


CCAGGGAGGT 


GCGCAGCCCC 


840 


302 


AGGTCGAGGG 


CGGTGGTGTA 


GACGGCCCGG 


GCGGTCTCGG 


GGGCGGGTGC 


CCGGCCGACG 


900 


304 


GCGTCGGTGA 


CGAACTCCAG 


GCCCCACAGC 


AGTCCGAGGC 


CGCGTACCTG 


GCCGAGCTGG 


960 


306 


GGGAAGCGGG 


ACTCCAGGGC 


GCGCAGCCGC 


TCCTGGATGA 


GCTCGCCGAG 


GACGCGCACG 


1020 


308 


CGGTCGATCA 


GCCGGTCGCG 


CTCGACGACC 


TCCAGCGTGG 


CGCGGGCGGC 


GGCGATCCCC 


1080 


310 


AGTGGGTTGC 


TCGCGTACGT 


CGAGGCGTAC 


GCCCCGGGGT 


GGCCGCCTCC 


GGCCTGCGCA 


1140 


312 


GCTTCCGCGC 


GTCCGGCCAG 


CACGGCGAAG 


GGGAATCCGC 


TCGCGGTGCC 


CTTGGACAGC 


1200 


314 


ATCGCCAGGT 


CCGGCTCGAT 


GCCGAACAGT 


TCGCTGGCGA 


GGAAGGCGCC 


GGTGCGCCCG 


1260 


316 


CCGCCGGTGA 


GGACCTCGTC 


GGCGACGAGC 


AGCACGCCGC 


CGTCCCGGCA 


GGCGCCGGCG 


1320 


318 


ATCCGCTCCC 


AGTAGCCGGG 


GGGCGGCACG 


ATGACGCCTG 


CCGCGCCGAG 


GACGGGTTCG 


1380 


320 


AAGACCAGGG 


CCGAGACGTT 


GGGCTTCTCC 


GCGATGTGCC 


GGCGCACGAG 


GGTCGCGCAC 


1440 


3 2 2 


CGCACGTCGC 


ACGAGGGGTA 


CTCCAGGCCC 


AGGGGACAGC 


GGTAGCCAGT 


AGGGGCTGTA 


1500 


324 


GCCAGCACGC 


TGTTGCCGCT 


GAAGGCCTGG 


TGGCCGATGT 


CCCAGTGGAC 


CAGCATCCGG 


1560 


326 


GCGCCCATGG 


TCTTGCCGTG 


GAAGCCGTGG 


CGCAGGGCGC 


AGATCCGGTT 


GCGGCCCGGC 


1620 


328 


GCGGCGGTCG 


CCTGGACGAC 


CCGCAGGGCG 


GCCTCGACCA 


CCTCCGCGCC 


GGTGGAGAAG 


1680 


3 3 0 


A A G G C G T A G G 


TGTCGAGCTC 


TTCCGCCACC 


a ^^^TCCCCA 


r* r> T\ r"T T r» n * r> 




i n a r\ 


332 


CGGTCCGGCG 


TGGCGCTGTC 


GTGGACGTTC 


CACAGGCGGC 


GGGCCTGGGT 


GGTGAGTGCC 


1800 


334 


TCGACGACCT 


CCGGGTGCCC 


GTGGCCCAGT 


GACTGGGTGA 


GGGTCCCGGC 


CGCGAAGTCG 


1860 
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336 AGGTACTGGT TGCCGTCCAG GTCGGTCAGA ACGGGACCGC GTCCCTCGGC GAAGACCCGG 

338 CGTCCGTGGA CGGCTTCCTC GGAGGCGCCC GGCGCCAGGT GGCGGGCCTC CCGTGCCAGG 

340 TGCTGTGTCT GCCGTAAGCC TGTCATCGCT GCCTCTGCTC GTCGGACCGG CTGACGCGAT 

342 CGCCGGCGAA CTGCGTTGTG GCGCACCACG GTTGGGGCGG CTCGGCGCTG AGTCAAACAC 

344 TTGAACACAC ACCGCTGCAA GAGTTTGCGG GTTGTTTCAG AAAGTTGTTG CGAGCGGCCC 

346 CGGCACTCTG GTTGAGTCGA CGTGCTTACG GCGCCACCAC GCCTCACGTT CGAGGAGGGA 

348 CCTGTGAGAA CAAGCCCGCA GACCGACCCG CTCCCGCGGA GGCCGAGGTG AAGGCCCTGG 

3 50 TCCTGGCAGG TGGAACCGGC AGCAGACTGA GGCCGTTCAC CCACACCGCC GCCAAGCAGC 

3 52 TGCTCCCCAT CGCCAACAAG CCCGTGCTCT TCTACGCGCT GGAGTCCCTC GCCGCGGCGG 

3 54 GTGTCCGGGA GGCCGGCGTC GTCGTGGGCG CGTACGGCCG GGAGATCCGC GAACTCACCG 

3 56 GCGACGGCAC CGCGTTCGGG TTACGCATCA CCTACCTCCA CCAGCCCCGC CCGCTCGGTC 

3 58 TCGCGCACGC GGTGCGCATC GCCCGCGGCT TCCTGGGCGA CGACGACTTC CTGCTGTACC 

360 TGGGGGACAA CTACCTGCCC CAGGGCGTCA CCGACTTCGC CCGCCAATCG GCCGCCGATC 

362 CCGCGGCGGC CCGGCTGCTG CTCACCCCGG TCGCGGACCC GTCCGCCTTC GGCGTCGCGG 

364 AGGTCGACGC GGACGGGAAC GTGCTGCGCT TGGAGGAGAA ACCCGACGTC CCGCGCAGCT 

366 CGCTCGCGCT CATCGGCGTG TACGCCTTCA GCCCGGCCGT CCACGAGGCG GTACGGGCCA 

368 TCACCCCCTC CGCCCGCGGC GAGCTGGAGA TCACCCACGC CGTGCAGTGG ATGATCGACC 

370 GGGGCCTGCG CGTACGGGCC GAGACCACCA CCCGGCCCTG GCGCGACACC GGCAGCGCGG 

372 AGGACATGCT GGAGGTCAAC CGTCACGTCC TGGACGGACT GGAGGGCCGC ATCGAGGGGA 

374 AGGTCGACGC GCACAGCACG CTGGTCGGCC GGGTCCGGGT GGCCGAAGGC GCGATCGTGC 

376 GGGGGTCACA CGTGGTGGGC CCGGTGGTGA TCGGCGCGGG TGCCGTCGTC AGCAACTCCA 

37 8 GTGTCGGCCC GTACACCTCC ATCGGGGAGG ACTGCCGGGT CGAGGACAGC GCCATCGAGT 

3 80 ACTCCGTCCT GCTGCGCGGC GCCCAGGTCG AGGGGGCGTC CCGCATCGAG GCGTCCCTCA 

3 82 TCGGCCGCGG CGCCGTCGTC GGCCCGGCCC CCCGTCTCCC GCAGGCTCAC CGACTGGTGA 

384 TCGGCGACCA CAGCAAGGTG TATCTCACCC CATGACCACG ACCATCCTCG TCACCGGCGG 

386 AGCGGGCTTC ATTCGCTCCG CCTACGTCCG CCGGCTCCTG TCGCCCGGGG CCCCCGGCGG 

388 CGTCGCGGTG ACCGTCCTCG ACAAACTCAC CTACGCCGGC AGCCTCGCCC GCCTGCACGC 

390 GGTGCGTGAC CATCCCGGCC TCACCTTCGT CCAGGGCGAC GTGTGCGACA CCGCGCTCGT 

392 CGACACGCTG GCCGCGCGGC ACGACGACAT CGTGCACTTC GCGGCCGAGT CGCACGTCGA 

394 CCGCTCCATC ACCGACAGCG GTGCCTTCAC CCGCACCAAC GTGCTGGGCA CCCAGGTCCT 

396 GCTCGACGCC GCGCTCCGCC ACGGTGTGCG CACCTTCGTG CACGTCTCCA CCGACGAGGT 

398 GTACGGCTCC CTCCCGCACG GGGCCGCCGC GGAGAGCGAC CCCCTGCTTC CGACCTCGCC 

400 GTACGCGGCG TCGAAGGCGG CCTCGGACCT CATGGCGCTC GCCCACCACC GCACCCACGG 

402 CCTGGACGTC CGGGTGACCC GCTGTTCGAA CAACTTCGGC CCCCACCAGC ATCCCGAGAA 

404 GCTCATACCG CGCTTCCTGA CCAGCCTCCT GTCCGGCGGC ACCGTTCCCC TCTACGGCGA 

406 CGGGCGGCAC GTGCGCGACT GGCTGCACGT CGACGACCAC GTCAGGGCCG TCGAACTCGT 

408 CCGCGTGTCG GGCCGGCCGG GAGAGATCTA CAACATCGGG GGCGGCACCT CGCTGCCCAA 

410 CCTGGAGCTC ACGCACCGGT TGCTCGCACT GTGCGGCGCG GGCCCGGAGC GCATCGTCCA 

412 CGTCGAGAAC CGCAAGGGGC ACGACCGGCG CTACGCGGTC GACCACAGCA AGATCACCGC 

414 GGAACTCGGT TACCGGCCGC GCACCGACTT CGCGACCGCG CTGGCCGACA CCGCGAAGTG 

416 GTACGAGCGG CACGAGGACT GGTGGCGTCC CCTGCTCGCC GCGACATGAC GTCGGGCCGG 

418 ACCGCAACCA CCGGCCCCGG CCGGCACACC GCCGCCCGCG GCCGGTGGCC GGCCGGTCAG 

420 CGTCCGTGAG CCGGGCGCCG GCCGCCCCGC GGGCCGGCGG CGGTGGACCC CCGGACCACC 

422 AGTTCCGGCA TGAAGACGAA TTCGGTGCGC GGCGGCGGCG TTCCGCTCAT CTCCTCCAGC 

424 AGTGCGTCCA CGGCGACCTG CCCCATCGCC TTGACGGGCT GTCTGATGGT GGTCAGGGGA 

426 GGGTCGGTGA AGGCCATGAG CGGCGAGTCG TCGAAGCCGA CCACCGAGAT GTCACCGGGA 

428 AC.C.GTCAGAC CCCGCCGGCG CGCGGCCCGC ACGGCGCCCA CGGCCATCAT GTCGCTGGCG 

430 CACATGACGG CGGTGCAGCC CAGGTCGATC AGCGCGGACG CGGCGGCCTG GCCCCCCTCC 

432 AGGGAGAACA GCGAGTGCTG CACGAGCTCC TCGGACTCCC GCGCCGACAC TCCCAGGTGC 
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